Overview

Dataset Statistics

Number of Variables 9
Number of Rows 20640
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 1.4 MB
Average Row Size in Memory 72.0 B
Variable Types
  • Numerical: 9

Dataset Insights

AveRooms is skewed Skewed
AveBedrms is skewed Skewed
Population is skewed Skewed
AveOccup is skewed Skewed
Latitude is skewed Skewed
Longitude is skewed Skewed
Longitude has 20640 (100.0%) negatives Negatives

Variables


MedInc

numerical

Approximate Distinct Count 12928
Approximate Unique (%) 62.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 330240
Mean 3.8707
Minimum 0.4999
Maximum 15.0001
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • MedInc is skewed right (γ1 = 1.6465)

Quantile Statistics

Minimum 0.4999
5-th Percentile 1.6006
Q1 2.5634
Median 3.5348
Q3 4.7432
95-th Percentile 7.3003
Maximum 15.0001
Range 14.5002
IQR 2.1799

Descriptive Statistics

Mean 3.8707
Standard Deviation 1.8998
Variance 3.6093
Sum 79890.6495
Skewness 1.6465
Kurtosis 4.951
Coefficient of Variation 0.4908
  • MedInc is not normally distributed (p-value 0.007919741942067812)
  • MedInc has 681 outliers

HouseAge

numerical

Approximate Distinct Count 52
Approximate Unique (%) 0.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 330240
Mean 28.6395
Minimum 1
Maximum 52
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • HouseAge is skewed right (γ1 = 0.0603)

Quantile Statistics

Minimum 1
5-th Percentile 8
Q1 18
Median 29
Q3 37
95-th Percentile 52
Maximum 52
Range 51
IQR 19

Descriptive Statistics

Mean 28.6395
Standard Deviation 12.5856
Variance 158.3963
Sum 591119
Skewness 0.06033
Kurtosis -0.8007
Coefficient of Variation 0.4394
  • HouseAge is not normally distributed (p-value 3.198980878617318e-05)

AveRooms

numerical

Approximate Distinct Count 19392
Approximate Unique (%) 94.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 330240
Mean 5.429
Minimum 0.8462
Maximum 141.9091
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • AveRooms is skewed right (γ1 = 20.6964)

Quantile Statistics

Minimum 0.8462
5-th Percentile 3.4323
Q1 4.4407
Median 5.2291
Q3 6.0524
95-th Percentile 7.6402
Maximum 141.9091
Range 141.0629
IQR 1.6117

Descriptive Statistics

Mean 5.429
Standard Deviation 2.4742
Variance 6.1215
Sum 112054.5547
Skewness 20.6964
Kurtosis 879.14
Coefficient of Variation 0.4557
  • AveRooms is not normally distributed (p-value 4.179085647533791e-24)
  • AveRooms has 511 outliers

AveBedrms

numerical

Approximate Distinct Count 14233
Approximate Unique (%) 69.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 330240
Mean 1.0967
Minimum 0.3333
Maximum 34.0667
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • AveBedrms is skewed right (γ1 = 31.3147)

Quantile Statistics

Minimum 0.3333
5-th Percentile 0.9391
Q1 1.0061
Median 1.0488
Q3 1.0995
95-th Percentile 1.273
Maximum 34.0667
Range 33.7333
IQR 0.09345

Descriptive Statistics

Mean 1.0967
Standard Deviation 0.4739
Variance 0.2246
Sum 22635.3751
Skewness 31.3147
Kurtosis 1636.3152
Coefficient of Variation 0.4321
  • AveBedrms is not normally distributed (p-value 9.708519870086214e-23)
  • AveBedrms has 1424 outliers

Population

numerical

Approximate Distinct Count 3888
Approximate Unique (%) 18.8%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 330240
Mean 1425.4767
Minimum 3
Maximum 35682
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Population is skewed right (γ1 = 4.9355)

Quantile Statistics

Minimum 3
5-th Percentile 348
Q1 787
Median 1166
Q3 1725
95-th Percentile 3288
Maximum 35682
Range 35679
IQR 938

Descriptive Statistics

Mean 1425.4767
Standard Deviation 1132.4621
Variance 1.2825e+06
Sum 2.9422e+07
Skewness 4.9355
Kurtosis 73.535
Coefficient of Variation 0.7944
  • Population is not normally distributed (p-value 3.2126712720360756e-18)
  • Population has 1196 outliers

AveOccup

numerical

Approximate Distinct Count 18841
Approximate Unique (%) 91.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 330240
Mean 3.0707
Minimum 0.6923
Maximum 1243.3333
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • AveOccup is skewed right (γ1 = 97.6325)

Quantile Statistics

Minimum 0.6923
5-th Percentile 1.8725
Q1 2.4297
Median 2.8181
Q3 3.2823
95-th Percentile 4.3334
Maximum 1243.3333
Range 1242.641
IQR 0.8525

Descriptive Statistics

Mean 3.0707
Standard Deviation 10.386
Variance 107.87
Sum 63378.3225
Skewness 97.6325
Kurtosis 10648.4303
Coefficient of Variation 3.3824
  • AveOccup is not normally distributed (p-value 4.226520428405932e-25)
  • AveOccup has 711 outliers

Latitude

numerical

Approximate Distinct Count 862
Approximate Unique (%) 4.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 330240
Mean 35.6319
Minimum 32.54
Maximum 41.95
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Latitude is skewed right (γ1 = 0.4659)

Quantile Statistics

Minimum 32.54
5-th Percentile 32.82
Q1 33.93
Median 34.26
Q3 37.71
95-th Percentile 38.96
Maximum 41.95
Range 9.41
IQR 3.78

Descriptive Statistics

Mean 35.6319
Standard Deviation 2.136
Variance 4.5623
Sum 735441.62
Skewness 0.4659
Kurtosis -1.1178
Coefficient of Variation 0.05995
  • Latitude is not normally distributed (p-value 4.733293899569894e-12)

Longitude

numerical

Approximate Distinct Count 844
Approximate Unique (%) 4.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 330240
Mean -119.5697
Minimum -124.35
Maximum -114.31
Zeros 0
Zeros (%) 0.0%
Negatives 20640
Negatives (%) 100.0%
  • Longitude is skewed left (γ1 = -0.2978)

Quantile Statistics

Minimum -124.35
5-th Percentile -122.47
Q1 -121.8
Median -118.49
Q3 -118.01
95-th Percentile -117.08
Maximum -114.31
Range 10.04
IQR 3.79

Descriptive Statistics

Mean -119.5697
Standard Deviation 2.0035
Variance 4.0141
Sum -2.4679e+06
Skewness -0.2978
Kurtosis -1.3301
Coefficient of Variation -0.01676
  • Longitude is not normally distributed (p-value 4.000582473597246e-07)

MedHouseVal

numerical

Approximate Distinct Count 3842
Approximate Unique (%) 18.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 330240
Mean 2.0686
Minimum 0.15
Maximum 5
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • MedHouseVal is skewed right (γ1 = 0.9777)

Quantile Statistics

Minimum 0.15
5-th Percentile 0.662
Q1 1.196
Median 1.797
Q3 2.6472
95-th Percentile 4.8981
Maximum 5
Range 4.85
IQR 1.4512

Descriptive Statistics

Mean 2.0686
Standard Deviation 1.154
Variance 1.3316
Sum 42695.0406
Skewness 0.9777
Kurtosis 0.3275
Coefficient of Variation 0.5579
  • MedHouseVal is not normally distributed (p-value 0.00025972617137478725)
  • MedHouseVal has 1071 outliers

Interactions

Correlations

Missing Values